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Abstract. Contextuality is a key feature of quantum mechanics that provides an important non- 
classical resource for quantum information and computation. Abramsky and Brandenburger used 
sheaf theory to give a general treatment of contextuality in quantum theory [New Journal of Physics 
13 (2011) 113036). However, contextual phenomena are found in other fields as well, for example 
database theory. In this paper, we shall develop this unified view of contextuality. We provide two 
main contributions: firstly, we expose a remarkable connection between contexuality and logical 
paradoxes; secondly, we show that an important class of contextuality arguments has a topological 
origin. More specifically, we show that “All-vs-Nothing” proofs of contextuality are witnessed by 
cohomological obstructions. 


1. Introduction 

Contextuality is one of the key characteristic features of quantum mechanics. It has been argued 
that it provides the “magic” ingredient enabling quantum computation [25]. There have been a 
number of recent experimental verifications that Nature does indeed exhibit this highly non-classical 
form of behaviour |3|26l|52l|53] . 

The study of quantum contextuality has largely been carried out in a concrete, example-driven 
fashion, which makes it appear highly specific to quantum mechanics. Recent work by the present 
authors mES] and others [laiiziiini has exposed the general mathematical structure of contextu- 
ality, enabling more general and systematic results. It has also made apparent that contextuality 
is a general and indeed pervasive phenomenon, which can be found in many areas of classical 
computation, such as databases [T] and constraints [3]. The work in [2] makes extensive use of 
methods developed within the logic and semantics of computation. 

The key idea from [2| is to understand contextuality as arising where we have a family of data 
which is locally consistent, bnt globally inconsistent. This can be understood, and very 
effectively visualised (see Fig. Ej) in topological terms: we have a base space of contexts (typically 
sets of variables which can be jointly measured or observed), a space of data or observations fibred 
over this space, and a family of local sections (typically valuations of the variables in the context) 
in these fibres. This data is consistent locally, but not globally: there is no global section defined 
on all the variables which reconciles all the local data. In topological language, we can say that the 
space is “twisted”, and hence provides an obstruction to forming a global section. 

This provides a unifying description of a number of phenomena which at first sight seem very 
different: 

• Quantum contextuality. The local data arises from performing measurements on compatible 
sets of observables. The fact that there is no global section corresponds to a no-go result 
for a hidden-variable theory to explain the observable data. 

• Databases. The local data are the relation tables of the database. The fact that there is no 
global section corresponds to the failure in general of the universal relation assumption 
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• Constraint satisfaction. The local data corresponds to the constraints, defined on subsets 
of the variables. The fact that there is no global section corresponds to the non-existence 
of a solution for the CSP. 

In the present paper, we shall develop this unified viewpoint to give a logical perspective on 
contextuality. In particular, we shall look at contextuality in relation to logical paradoxes: 

• We find a direct connection between the structure of quantum contextuality and classic 
semantic paradoxes such as “Liar cycles” [MHMIEQIEI]. There are also close connections to 
the study of circuits with feedback m- 

• Conversely, contextuality offers a novel perspective on these paradoxes. Contradictory 
cycles of references give rise to exactly the form of local consistency and global inconsistency 
we find in contextuality. 

Mathematical structure. Sheaf theory [32] provides the natural mathematical setting for our 
analysis, since it is directly concerned with the passage from local to global. In this setting, it is 
furthermore natural to use sheaf cohomology to characterise contextuality. Cohomology is one 
of the major tools of modern mathematics, which has until now been conspicuous by its absence, 
both in theoretical computer science, and in quantum information. The use of cohomology to 
characterise contextuality was initiated in [5]. In the present paper, we take the cohomological 
approach considerably further, taking advantage of situations in which the outcomes of observations 
have an algebraic structure. This applies, for example, in the case of the standard Pauli spin 
observables, which have eigenvalues in Z 2 . 

We study a strong form of contextuality arising from so-called “All-vs-Nothing” arguments m- 
We give a much more general formulation of such arguments than has appeared previously, in 
terms of local consistency and global inconsistency of systems of linear equations. We also show 
how an extensive class of examples of such arguments arises in the stabiliser fragment of quantum 
mechanics, which plays an important role in quantum error correction |l2] and measurement-based 
quantum computation m- 

We then show how all such All-vs-Nothing arguments are witnessed by the cohomological ob¬ 
struction to the extension of local sections to global ones previously studied in [5| . This obstruction 
is characterised in more abstract terms than previously, using the connecting homomorphism of the 
long exact sequence. Our main theorem establishes a hierarchy of properties of probability models, 
relating their algebraic, logical and topological structures. 

Outline. The further structure of the paper is as follows. In ^ we show how our topological 
approach provides a unified description of many apparently different phenomena, linking contex¬ 
tuality, paradoxes and the “logical twisting” of spaces, leading to locally consistent data with 
obstructions to global consistency. This approach is then formalised in ^ In ^ we introduce 
All-vs-Nothing arguments and show how they arise concretely in stabiliser states. We then give 
our general formulation of these arguments as global logical inconsistencies. In ^ we develop the 
formal language of sheaf cohomology and apply it to provide cohomological witnesses of contextu¬ 
ality. Finally, in ^ we prove our main result, which relates the non-vanishing of the cohomology 
invariant to global logical inconsistency. 

Acknowledgements. Discussions with Alexandru Baltag and Louis Narens provided an impor¬ 
tant stimulus to studying the connections between contextuality and foundational topics in logic. 
We also thank Nicholas Teh for many useful discussions. Support from the following is gratefully 
acknowledged: John Templeton Foundation, AFOSR, EPSRC, and the Oxford Martin School. 

2. The many faces of contextuality 

We begin with the following scenario, depicted in Fig. [TJ Alice and Bob are agents positioned at 
nodes of a network. Alice can access local bit registers ai and 02 , while Bob can access local bit 
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Figure 1. Alice and Bob look at bits 
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Figure 2. The Bell table 


registers bi, 62 - Alice can load one of her bit registers into a processing unit, and test whether it 
is 0 or 1. Bob can perform the same operations with respect to his bit registers. They send the 
outcomes of these operations to a common target, which keeps a record of the joint outcomes. 

We now suppose that Alice and Bob perform repeated rounds of these operations. On different 
rounds, they may make different choices of which bit registers to access, and they may observe 
different outcomes for a given choice of register. The target can compile statistics for this series of 
data, and infer probability distributions on the outcomes. 

The probability table in Fig. [2] (ignore the highlights for the moment) records the outcome of such 
a process. Each row of the table specifies a probability distribution on the possible joint outcomes, 
conditioned on the indicated choice of bit registers by Alice and Bob. Consider for example the 
cell at row 2, column 3 of the table. This corresponds to: 

• Alice loads register ai and observes the value 0. 

• Bob loads register 62 and observes the value 1. 

This event has the probability i/s, conditioned on Alice’s choice of ai and Bob’s choice of 62 - 
We can now ask: How can such an observational scenario be realised? The obvious classical 
mechanism we can propose to explain these observations is depicted in Fig. [3i We postulate a 
source which on each round chooses values for each of the registers ai, 02 , bi, b 2 , and loads each 
register with the chosen value. Alice and Bob will then observe the values which have been loaded 
by the source. We can suppose that this source is itself randomised, and chooses the values for the 
registers according to some probability distribution P on the set of 2“^ possible assignments. Then 
the question becomes: is there any distribution P which would give rise to the table specified in 
Fig.[2p 
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Figure 3. A Source 


A key observation is that, in order for this question to be non-trivial, we must assume that the 
choices of bit registers made by Alice and Bob are independent of the source0 If the source could 
determine which registers are to be loaded on each round, as well as their values, then it becomes 
a trivial matter to achieve any given probability distribution on the joint outcomes. 

Under this assumption of independence, it becomes natural to think of this scenario as a kind 
of correlation game. The aim of the source is to achieve as high a degree of correlation between 
the outcomes of Alice and Bob as possible, regardless of their choices on each round. 

2.1. Logic rings a Bell. We shall now make an apparently innocuous deduction in elementary 
logic and probability theory. 

Suppose we have propositional formulas (fi, ■ ■ ■ ,(pN, and that we can assign a probability pi to 
each Pi. In particular, we have in the mind the situation where the boolean variables appearing in 
Pi correspond to empirically testable quantities, such as the values of bit registers in our scenario; 
Pi then expresses a condition on the outcomes of an experiment involving these quantities. The 
probabilities pi are obtained from the statistics of these experiments. 

Now suppose that these formulas are not simultaneously satisfiable. Then (e.g.) 

N-l N-1 

l\ (j)i ^ ^4 >n , or equivalently 4>n ^ \/ ■ 

i=l i=l 

Using elementary probability theory, we can calculate: 

N-l N-l N-l 

PN < Prob( \/ ^(pi) < ^ Prob(^(/.i) = ^(l-p*). 

i=l 2=1 2=1 

The first inequality is the monotonicity of probability, and the second is sub-additivity. Hence we 
obtain the inequality 

N 

< iV -1 . 

2=1 

We shall refer to this as a logical Bell inequality. Note that it hinges on a purely logical 
consistency condition. 


e.g 


This translates formally into a conditional independence assumption, which we shall not spell out here; see 
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Figure 4. The Hardy paradox 

2.2. Logical analysis of the Bell table. We return to the probability table from Fig. [2j If we 
read 0 as true and 1 as false, the highlighted events (sets of assignments) in each row of the table 
are represented by the following propositions: 


Pi = 

(oi A 61) 

V (-lOi A -161) 

— Oi "fA 

61 

P 2 = 

(oi A 62) 

V (-lOi A -'62) 

= Oi •H■ 

62 

P 3 = 

(02 A 61) 

V (-'O2 A -ifoi) 

= 02 AA 

bi 

Pi = 

(-■02 A 62) V (02 A -'62) 

= O2 AA 

-62 


The events on the first three rows are the correlated outcomes; the fourth is anticorrelated. These 
propositions are easily seen to be jointly unsatisfiable. It follows that our logical Bell inequality 
should apply, yielding the inequality Yli=iPi ^ 3 . However, we see from the table that pi = 1 
and Pi = 6/8 for i = 2,3, 4. Hence the table yields a violation of the Bell inequality by 1 / 4 . 

This rules out the possibility of giving an explanation for the observational behaviour described by 
the table in terms of a classical source. We might then conclude that such behaviour simply cannot 
be realised. However, in the presence of quantum resources, and in particular entangled quantnm 
states [42], this is no longer the case. More specifically, if we use the Bell state (|j't) + ||4))A/2) with 
Alice and Bob performing 1-qubit local measurements corresponding to directions in the XT-plane 
of the Bloch sphere, at relative angle ’^/s, then this behaviour is physically realisable according 
to the predictions of quantum mechanics—our most highly-confirmed physical theory^ 

More broadly, we can say that this shows that quantum mechanics predicts correlations which 
exceed those which can be achieved by any classical mechanism. This is the content of Bell’s 
theorem [9|, a famous result in the foundations of quantum mechanics, and in many ways the 
starting point for the whole field of quantum information. Moreover, these predictions have been 
confirmed by many experiments mm- 

The logical Bell inequality we used to derive this result is taken from |1|. Bell inequalities are a 
central technique in quantum information. In [3] it is shown that every Bell inequality (i.e. every 
inequality satisfied by the “local polytope”) is equivalent to a logical Bell inequality, based on 
purely logical consistency conditions. 

2.3. Logical forms of contextuality. We shall now see how the same phenomena manifest them¬ 
selves in a stronger form, which highlights a direct connection with logic. 

Consider the table in Fig. |41 which depicts the same kind of scenario we considered previously. 
However, the entries are now either 0 or 1. The idea is that a 1 entry represents a positive probabil¬ 
ity. Thus we are distinguishing only between possible (positive probability) and impossible (zero 
probability). In other words, the rows correspond to the supports of some (otherwise unspecified) 
probability distributions. Note that only four entries of the table are filled in. Our claim is that 

^Here, the choice of measurement directions by Alice and Bob plays the role of the “choice of local bit registers to 
load” in our original description. 
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just from these entries, referring only to the supports, we can deduce that there is no classical 
explanation for the behaviour recorded in the table. Moreover, this behaviour can be realised in 
quantum mechanics [241141] . yielding a stronger form of Bell’s theorem, due to Hardy |24j . 

What do “observables” observe? Classically, we would take the view that physical observables di¬ 
rectly reflect properties of the physical system we are observing. These are objective properties 
of the system, which are independent of our choice of which measurements to perform, i.e. of 
our measurement context. More precisely, this would say that for each possible state of the 
system, there is a function A which for each measurement m specifies an outcome A(m), inde¬ 
pendently of which other measurements may be performed. This point of view is called 
non-contextuality, and may seem self-evident. However, this view is impossible to sustain in 
the light of our actual observations of (micro)-physical reality. 

Consider once again the Hardy table depicted in Fig. 01 Suppose there is a function A which 
accounts for the possibility of Alice observing value 0 for ai and Bob observing 0 for 6 i, as asserted 
by the entry in the top left position in the table. Then this function A must satisfy 

A *• I —y 0, I —y 0 . 

Now consider the value of A at 62 . If A( 62 ) = 0, then this would imply that the event that ai has 
value 0 and 62 has value 0 is possible. However, this is precluded by the 0 entry in the table for 
this event. The only other possibility is that X(b 2 ) = 1. Reasoning similarly with respect to the 
joint values of 02 and b 2 , we conclude, using the bottom right entry in the table, that we must have 
A(a 2 ) = 0. Thus the only possibility for A consistent with these entries is 

A L a\ I— y 0, a 2 '— y 0, b\ 1 — y 0, 62 *— y 1 . 

But this would require the outcome (0,0) for measurements ( 02 , ^ 1 ) to be possible, and this is 
precluded by the table. 

It is easily seen that this argument amounts to showing that the following formulas are not jointly 
satisfiable: 

aiAbi, -'(aiA62), “'(02 A 61), 02 V 62 • 

We are thus forced to conclude that the Hardy models are contextual. Moreover, we can say 
that they are contextual in a logical sense, stronger than the probabilistic form we saw with the 
Bell tables, since we only needed information about possibilities to infer the contextuality of this 
behaviour. 

Strong contextuality. Logical contextuality as exhibited by the Hardy paradox can be expressed in 
the following form: there is a local assignment (in the Hardy case, the assignment oi 1 —?■ 0, 61 i-A 0) 
which is in the support, but which cannot be extended to a global assignment which is compatible 
with the support. This says that the support cannot be covered by the projections of global 
assignments. A stronger form of contextuality is when no global assignments are consistent 
with the support at all. Note that this stronger form does not hold for the Hardy paradox. 

Several much-studied constructions from the quantum information literature exemplify strong 
contextuality. An important example is the Popescu-Rohrlich (PR) box [46] shown in Fig. [5j 

This is a behaviour which satisfies the no-signalling principle [20|, meaning that the prob¬ 
ability of Alice observing a particular outcome for her choice of measurement (e.g. ai = 0), is 
independent of whether Bob chooses measurement 61 or 62 ; and vice versa. That is, Alice and 
Bob cannot signal to one another—the importance of this principle is that it enforces compatibility 
with relativistic constraints. However, despite satisfying the no-signalling principle, the PR box 
does not admit a quantum realisation. Note that the full support of this model correspond to the 
propositions used in showing the contextuality of the Bell table from Fig. [21 and hence the fact that 
these propositions are not simultaneously satisfiable shows the strong contextuality of the model. 
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Figure 5 . The PR Box 
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Figure 6. The Hardy table and the PR box as bundles 


In fact, there is provably no bipartite quantum-realisable behaviour of this kind which is strongly 
contextual [ 29 l| 35 ]. However, as soon as we go to three or more parties, strong contextuality does 
arise from entangled quantum states, as we shall see in 0 

Visualizing contextuality. The tables which have appeared in our examples can be displayed in 
a visually appealing way which makes the fibred topological structure apparent, and forms an 
intuitive bridge to the formal development of the sheaf-theoretic ideas in the next section. 

Firstly, we look at the Hardy table from Fig. 01 displayed as a “bundle diagram” on the left of 
Fig. [6l Note that all unspecified entries of the Hardy table are set to 1 . 

What we see in this representation is the base space of the variables oi, 02 , 61 , 62 - There is 
an edge between two variables when they can be measured together. The pairs of co-measurable 
variables correspond to the rows of the table. In terms of quantum theory, these correspond to 
pairs of compatible observables. Above each vertex is a fibre of those values which can be 
assigned to the variable—in this example, 0 and 1 in each fibre. There is an edge between values 
in adjacent fibres precisely when the corresponding joint outcome is possible, i.e. has a 1 entry 
in the table. Thus there are three edges for each of the pairs {oi, 62}, {02, bi} and {02, 62}- 

A global assignment corresponds to a closed path traversing all the fibres exactly once. We 
call such a path univocal since it assigns a unique value to each variable. Note that there is such a 
path, marked in blue; thus the Hardy model is not strongly contextual. However, there is no such 
path which includes the edge displayed in red. This shows the logical contextuality of the model. 

Next, we consider the PR box displayed as a bundle on the right of Fig. [6j In this case, the 
model is strongly contextual, and accordingly there is no univocal closed path. 
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Figure 7 . The Liar Cycle 


Contextuality, logic and paradoxes. The arguments for quantum contextuality we have discussed 
may be said to skirt the borders of paradox, but they do not cross those borders. The informa¬ 
tion we can gather from observing the co-measurable variables is locally consistent, but it cannot 
in general be pieced together into a globally consistent assignment of values to all the variables 
simultaneously. Thus we must give up the idea that physically observable variables have objective, 
“real” values independent of the measurement context being considered. This is very disturbing 
for our understanding of the nature of physical reality, but there is no direct contradiction between 
logic and experience. We shall now show that a similar analysis can be applied to some of the 
fundamental logical paradoxes. 

Liar cycles. A Liar cycle of length is a sequence of statements as shown in Fig. [71 
For N = 1, this is the classic Liar sentence 

S : 5 is false. 

These sentences contain two features which go beyond standard logic: references to other sentences, 
and a truth predicate. While it would be possible to make a more refined analysis directly modelling 
these features, we will not pursue this here, noting that it has been argued extensively and rather 
compellingly in much of the recent literature on the paradoxes that the essential content is preserved 
by replacing statements with these features by boolean equations [161130115(111511 . For the Liar 
cycles, we introduce boolean variables xi,... ,x„, and consider the following equations: 

Xi — X2, ■ ■ ■ ) Xn—l — XjT,, Xn — ~'Xl . 

The “paradoxical” nature of the original statements is now captured by the inconsistency of these 
equations. 

Note that we can regard each of these equations as fibered over the set of variables which occur 
in it: 

Xi = X2 
X 2 = X 3 

Xn—l — Xji 
Xn = -'Xi 

Any subset of up to n — 1 of these equations is consistent; while the whole set is inconsistent. 

Up to rearrangement, the Liar cycle of length 4 corresponds exactly to the PR box. The usual 
reasoning to derive a contradiction from the Liar cycle corresponds precisely to the attempt to find 
a univocal path in the bundle diagram on the right of Fig. [6l To relate the notations, we make the 


{XI,X2} 

{X2,X3} 

{Xji—l., Xfi} 
{Xn,Xi} 
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following correspondences between the variables of Fig. [ 6 ] and those of the boolean equations: 

Xi ~ 02, X 2 ~ bi, X3 ~ Oi, X4 ~ 62 • 

Thus we can read the equation xi = X 2 as “02 is correlated with 61 ”, and X 4 = -ixi as “02 is 
anti-correlated with 62 ”. 

Now suppose that we try to set 02 to 1. Following the path in Fig. [ 6 ] on the right leads to the 
following local propagation of values: 

a 2 = l'^ 6 i = l'^ai = l '^62 = l'^ffl 2 = 0 
02 = 0'^ 61 = 0-^ 01=0-^ 62 = 0'^ 02 = 1 

The hrst half of the path corresponds to the usual derivation of a contradiction from the assumption 
that Si is true, and the second half to deriving a contradiction from the assumption that Si is false. 

We have discussed a specific case here, but the analysis can be generalised to a large class of 
examples along the lines of [1611501 . The tools from sheaf cohomology which we will develop in the 
remainder of the paper can be applied to these examples. We plan to give an extended treatment 
of these ideas in future work. 

The Robinson Consistency Theorem. As a final remark on the connections between contextuality 
and logic, we consider a classic result, the Robinson Joint Consistency Theorem j^S]. It is usually 
formulated in a hrst-order setting. The version we will use has the following form: 

Theorem 2.1 (Robinson Joint Consistency Theorem). Let Ti he a theory over the language Li, 
i = 1,2. If there is no sentence (j) in LiH L 2 with Ti h ^ and T 2 I— '4>, then Ti U T 2 is consistent. 

Thus this theorem says that two compatible theories can be glued together. In this binary 
case, local consistency implies global consistency. Note, however, that an extension of the theorem 
beyond the binary case fails. That is, if we have three theories which are pairwise compatible, it 
need not be the case that they can be glued together consistently. A minimal counter-example is 
provided at the propositional level by the following “triangle”: 

Ti = {xi fA -.X 2 }, T 2 = {X 2 AA -.X 3 }, Ts = {X 3 AA -.Xi}. 

This example is well-known in the quantum contextuality literature as the Specker triangle |31j . 
Although not quantum realizable, it serves as a basic template for more complex examples which 
are. See [2l[T3ll^ for further details. 

3. Sheaf formulation of contextuality 

In this section we summarise the main ideas of the sheaf-theoretic formalism for contextuality [2] . 
In ^2.31 we saw that logical contextuality can be expressed in terms of a bundle of outcomes over 
a base space of measurements and contexts. This idea can be formalized by regarding the bundle 
as a sheaf. 

For our purposes, it will be sufficient to view the base space as the discrete space on a finite 
set X of variables. In the quantum case, these variables will be labels for measurements. The 
measurement contexts will be represented by a family Ai = {Ci}i£i of subsets of X. These are 
the sets of variables which can be measured together—in quantum terms, the compatible families 
of observables. We assume that Ai covers X, i.e. y] Ai = X] hence we call Ai a measurement 
cover. We shall also assume that Ai forms an antichain, so these are the maximal contexts. We 
also assume that all the variables have the same fibre, O, of values or outcomes that can be assigned 
to them. Such a triple {X,Ai,0) is called a measurement scenario. We define a presheaf of 
sets over V{X), namely £:: U 1 —)• with restriction 

£{U C U'): £{U') £{U) :: s ^ s\u ■ 
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This presheaf £ is in fact a sheaf, called the sheaf of events. Each s G £{U) is a section, and, in 
particular, g G £{X) is a global section. 

Note that a probability table such as the Bell table (Fig. [2]) can be represented by a family 
{pc}c€M with Pc a probability distribution on £{C) = where contexts C corresponds to the 
rows of the table. Similarly, “possibility tables” such as the Hardy model and the PR box (Figs. 0] 
and ED can be represented by boolean distributions. This latter case, with which the logical and 
strong forms of contextuality are concerned, can equivalently be represented by a subpresheaf S of 
T, where for each context C/ C X, S{U) C is the set of all possible outcomes. Explicitly, S is 
defined as follows, where supp (pclunc) is the support of the marginal of pc at U (1 C. 

S{U) := {s G I VC G M. s\unc G supp {pc\unc)} 

Abstracting from this situation, we assume we are dealing with a sub-presheaf 5 of T with the 
following properties: 

(El) 5(C) ^ 0 for all C G M 

(i.e. that any possible joint measurement yields some joint outcome), and moreover that 
(E2) S is flasque beneath the cover, meaning that S{U C U'): S{U') —> S{U) is surjective 
whenever U '^U' C C for some C G Ad, 

which amounts to saying that the underlying empirical model satisfies the no-signalling principle 

m- 

(E3) A compatible family for the cover Ad is a family {sc}c&M with sc G S{C), and such 
that: 

VC, C' G Ad. sc\cnc' = sc'IcnC' • 

We assume that such a family induces a global section in S{X). (This global section must 
be unique, since 5 is a subpresheaf of £, hence separated). 

What these conditions say is that S is determined by its values 5(C) at the contexts C £ Ai, below 
Ad by being flasque, and above Ad by the sheaf condition. 

Definition 3.1. By an empirical model on (X, Ad,0), we mean a subpresheaf 5 of 5 satisfying 

m, (ESI), and (m. 

In [2], we used the term “empirical model” for the probability table {pc}c&M- la the present 
paper, we shall only work with the associated support presheaf 5, and so it is more convenient to 
refer to this as the model. 

We can use this formalisation to characterize contextuality as follows. 

Definition 3.2. For any empirical model 5: 

• For all C G Ad and s G 5(C), 5 is logically contextual at s, written LC(5,s), if s is not a 
member of any compatible family. 5 is logically contextual, written LC(5), if LC(5,s) 
for some s. 

• 5 is strongly contextual, written SC(5), if LC(5,s) for all s. Equivalently, if it has no 
global section, i.e. if 5(X) = 0. 

Note that, for every probability table {pc}ceA4 that satisfies the no-signalling principle, the 
supports of the distributions pc induce an empirical model 5, and therefore logical or strong 
contextuality can be characterized as above. This formulation of contextuality makes it natural to 
use sheaf cohomology, as we will see in ^ 

4. All-vs-Nothing arguments 

Quantum theory provides many instances of strong contextuality. Among the first to observe 
quantum strong contextuality (though not in the general terms that we do) was Mermin [37] . who 
showed the GHZ state to be strongly contextual using a kind of argument he dubbed ‘all versus 
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nothing’. We show in ^4.II that arguments of this type can in fact be used to prove strong contex- 
tuality for a large class of states in quantum theory, particularly in stabiliser quantum mechanics, 
which plays a crucial role in quantum computation. Moreover, in ^4.21 we give a much more gen¬ 
eral formulation of this type of argument that can be used to show strong contextuality for a much 
larger class of models. 

4.1. All -vs-Nothing for quantum theory. The GHZ state is a tripartite state of qubits, defined 
as (Ittt) + |iii))A/2. We assume that each party z = 1,2,3 can perform Pauli measurements in 
{Xi,Yi}, and each measurement has outcomes in O = Z 2 = {0, IjH Then, following Mermin’s 
argument, the possible joint outcomes satisfy these parity equations: 


Xi © 1^2 0 I3 — 0 


hi © X 2 © Is — 0 


Yi © ^2 © As = 0 Xi © A 2 © As = 1 

This system of equations is inconsistent because, regardless of the outcomes assigned to the ob¬ 
servables Ai,... ,© 3 , the left-hand sides sum to 0 (since each variable occurs twice) whereas the 
right-hand sides sum to 1. This shows that the model is strongly contextual, as there is no global 
assignment of outcomes to observables consistent with the observed local assignments. 

The essence of the argument is that the possible local assignments satisfy systems of parity 
equations that admit no global solution. We call this an All-vs-Nothing argument. 

In fact, such arguments arise naturally from a much larger class of states in stabiliser quantum 
theory [42] . Consider the Pauli n-group Vn, whose elements are n-tuples of Pauli operators (from 
{A, Y, Z, /}) with a global phase from {±1, ±z}. 

Definition 4.1. An AvN triple in Vn is a triple (e, f,g) of elements of Vn with global phases +1, 
which pairwise commute, and which satisfy the following conditions: 

(Al) For each z = 1,... , n, at least two of e*, fi, gi are equal. 

(A2) The number of z such that ei = g^ ^ fi, all distinct from I, is odd. 

Mermin’s argument, and the other All-vs-Nothing arguments which have appeared in the liter¬ 
ature, can be seen to come down to exhibiting AvN triples. 

Theorem 4.2. Let S be the subgroup of Vn generated by an AvN triple, and Vg the subspace 
stabilised by S. For every state \ijj) in Vs, the empirical model realised by \ijj) under the Pauli 
measurements admits an All-vs-Nothing argument. 


Proof. Firstly, we recall the quantum mechanics formula for the expected value of an observable A 
on a state v: 


{A)v = 


Note that 


(z;|A|z;) = 1 A\v) = |z;). 

Thus A stabilises the state v iff the expected value is 1. Suppose that A is a dichotomic observable, 
with eigenvalues +1,-1 (see footnote 3), and z; is a state it stabilises. The support of the distribu¬ 
tion on joint outcomes obtained by measuring A on z; must contain only outcomes of even parity; 
while if — A stabilises v, then the support will contain only outcomes of odd parity. If A = Pi • • • 
in Vn, the former case translates into the equation 


Xi © • • • © Xn = 0 


^Although the eigenvalues of the Pauli matrices are +1 and —1, we relabel +1, —1, x as 0, 1, ©, respectively. The 
eigenvalues of a joint measurements A 2 ® A 3 are the products of the eigenvalues of the measurements at each 
site, so they are also ±1. As such, in the usual representation, these joint measurements are still dichotomic and only 
distinguish joint outcomes up to parity. Mermin’s argument shows that this information is sufficient to derive strong 
contextuality. 
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where we associate the variable Xi with Pi] while in the latter case, it corresponds to the equation 

Xi © • • • © Xn = 1. 

If the set of equations satisfied by a state v stabilised by a subgroup S of Vn is inconsistent, we say 
that V admits an All-vs-Nothing argument with respect to the measurements h with global 
phase +1 such that either /i or —/i is in S'. 

We now show that any state v in the subspace Vg stabilised by the subgroup S generated 
by an AvN triple (e, /, g) admits an All-vs-Nothing argument. Firstly, by the algebra of the 
Pauli matrices, we see from (|Aip that if {ei,fi,gi} = {P,Q}, with at least two equal to P, the 
componentwise product eifigi will, disregarding global phase, be Q. By (IA2h . we see that the 
product efg = —h, an element of Vn with global phase —1, which translates into a condition of 
odd parity on the support of any state stabilised by these operators for the measurement h. On 
the other hand, condition dm) implies that under any global assignment to the variables, we can 
cancel the repeated items in each column, and deduce an even parity for h. □ 

If e,f,g have linearly independent check vectors, they generate a subgroup S such that Vg has 
dimension 2”“^ mM- Thus we obtain a large class of states admitting All-vs-Nothing arguments. 

4.2. Generalized All-vs-Nothing arguments. Despite their established use in the quantum 
literature, All-vs-Nothing arguments as considered above do not exist for all strongly contextual 
models. Nonetheless, a natural generalization applies to more models. For instance, the model 
called ‘box 25’ in [35] admits no such (parity) argument, but it still satisfies the following equations, 
in which the coefficients of each variable on the left-hand sides add up to a multiple of 3, whereas 
the right-hand sides do not: 

ao + 26o = 0 mod 3 ai + 2co = 0 mod 3 

ao + bi + cq = 2 mod 3 oq + 6 i + ci = 2 mod 3 

ai + bo + Cl = 2 mod 3 oi + 6i + ci = 2 mod 3 

This example suggests using a general instead of just Z 2 . But once we realize that it is the ring 
structure of which plays the key role, we can obtain an even more general version. 

Fix a rin^ R, and a measurement scenario {X, M, R). 

Definition 4.3. An i?-linear equation is a triple cj) = {C,a,b) with C G A4, a: C —?■ R and 
b ^ R. Write := C. An assignment s G £{C) satisfies (j), written s |= (/>, if 

a{m)s{m) = b . 

m^C 

This lifts to the level of systems of equations, or theories, and sets of assignments, or “models”: 

• A system of equations F has a set of satisfying assignments, M(F) := {s G £{C) | Vi;/) G F. s |= 1 ?!)}. 

• A set of assignments S C £{C) determines an i?-linear theory, Tr{S) :={(/> | Vs G S. s |= i;/)}. 

Definition 4.4. Given an empirical model S, define its i?-linear theory to be 

Tr{S) := U Tr{S{C)) = {,/. I Vs G S{V^). s ^ <P} . 
ceM 

We say that S is AvN^, written AvN7^(5), if ¥^(5) is inconsistent, meaning that there is no global 
assignment g: X —)• R such that V(^ G Tr{S). g\v^ |= 4>. 

Proposition 4.5. An AvN/j model is strongly contextual. 

Proof. Suppose S is not strongly contextual, i.e. that there is some g G S{X). Then, for each 
(f G Tr{S), g\v^ G S{V^), hence g\v^ ^ cj). Thus, Tr{S) is consistent. □ 

‘^All rings considered in this paper will be commntative and with nnit. 
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4.3. AfRne closures. We now consider the relationship between ii-linear theories and empirical 
models more closely. We shall relate the existence of an AvN/j argument to the strong contextuality 
of a derived model. 

First, we focus on a single context, or set of variables, U O X. The maps between theories and 
models, 

T 

V£{U) Theories , 

M 

form a Galois connection, corresponding to the lifting of the satisfaction relation to the powersets: 

S C M(r) iff T(5) D r . 

We consider the closure operator M o T, which gives the largest set of assignments whose theory 
is still the same. First, note that £{U) = BP is a (free) i?-module (and, when R is a field, a vector 
space over R). Given solutions si,..., to a linear equation, an affine combination of them. 


cisi + • • • + ctSt such that ci + • • • + c* = 1 , 

is again a solutiorH. In other words, the set of solutions M(r) to a system of equations F is an 
affine submodule of £{U). This means that 

(1) aff < M o T , 


where aff S stands for the alRne closure of a set S C £{U): 


affS := W 


Co, Si 


. 2=1 


Si G S', Cj G i?, ( 


2=1 



In the particular case of vector spaces (i.e. when i? is a field), ([T]) is in fact an equality: hence, affine 
subspaces are exactly the possible solution sets of a theory, and there cannot exist two different 
affine subspaces with the same theory, as may happen for general rings R. 

We now lift this discussion to the level of empirical models. The natural thing to do is to take 
the affine closure at each context C G A4. However, one must be careful to ensure that this yields 
a well-defined empirical model. Firstly, note that the affine closure operation above is natural on 
U: it gives a natural transformation bR : V o £ —;• V o £, meaning that 


(2) (aff5)|f;. = aff(S|f/0 . 

This follows easily from the fact that the module operations on £{U) are defined coordinatewise. 


Definition 4.6. Let S be an empirical model on the scenario {X, Ai, R). We define its afline 
closure, AfF5, as the empirical model given, at each C G A4, by 

(AfF5)(C) := afF(5(C)) . 

The property in equation ([2]) guarantees that AfF5 can be consistently defined to be flasque 
below the cover as (AfF5)([/) = aff(5(C/)). Note, however, that this equality does not hold for 
U above the cover. In particular, it may happen that S{X) = 0 {S strongly contextual), but 
(Aff5)(X) p 0. 

Since Ti^(5) is given as the union of the theories at each maximal context, the Galois connection 
above lifts to the level of empirical models. Moreover, we also have 

(3) AfF < M o T , 

with equality when ii is a field. 

One can therefore relate the notion of S being AvNjj to the strong contextuality of the affine 
closure of S. 

^AfEneness is required because the equations may be inhomogeneous. 
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Proposition 4.7. Let S be an empirical model on {X,Ai,R). Then, 

AvN(5) ^ SC(Aff5) . 

If R is a field, the converse also holds. 

Proof. From inequality ([3]), Tr(5) = Ti{(Aff5). Hence, if S is AvN/j, then so is Aff5, implying by 
Proposition 14.51 that it is strongly contextual. 

For the converse in the case that i? is a field, suppose that Tij(5) is consistent. This means 
that there is a global assignment g: X —?■ R satisfying all the equations in ¥^(5). But since 
for fields MT/j(5) = Aff5, we have that g G (Aff5)(X), hence the model Aff5 is not strongly 
contextual. □ 


5 . Cohomology witnesses contextuality 

The logical forms of contextuality are characterised by the existence of obstructions to the ex¬ 
tension of local sections to global compatible families. Thus, it seems natural to apply the tools 
of sheaf cohomology, which are well-suited to identifying obstructions of this kind, in order to pro¬ 
vide cohomological witnesses for contextuality. This idea was put forward in previous work by the 
authors [ 5 ], the main points of which we now summarise. In the next section, we shall prove that 
such cohomological witnesses of contextuality exist for the whole class of AvN/j models. 

5 . 1 . Cech cohomology. Let X be a topological space, Ai be an open cover of X, and let 
X: 0 (X)°^ —> AbGrp be a presheaf of abelian groups on X. We shall be particularly concerned 
with the case where X is a set of measurements (a discrete space), and Ai is the cover of maximal 
contexts of a measurement scenario. 

Definition 5.1. A g-simplex of the nerve of A 4 , is a tuple a = {Cq, ..., Cq) of elements of Ai 
with non-empty intersection, |(t| := 0 . We write M{AiY for the set of g-simplices. 

The 0 -simplices are simply the elements of the cover Ai, and the 1 -simplices are the pairs {Ci, Cj) 
of intersecting elements of the cover. Given a q + 1-simplex a = {Cq, ..., Gg+i), we can obtain q- 
simplices 

dj{a) := {Co,...,Cj,..., Cq+i) 0<j<q + l 

by omitting one of the elements. Note that |(t| C \dj{a)\, and so the presheaf X has a restriction 
map 

p\lf^^-.X{\dj{a)\)^Xi\a\) . 

We now define the Cech cochain complex. 

Definition 5.2. For each q >0, the abelian group of g-cochains is defined by: 

C^Ai,X) := n X(|u|) . 

The g-coboundary map, 

5^: C%M,X) C^+^{Ai,X) , 

is defined as follows: for each uj = {ov{t))t£M(M)i ^ C^{Ai,X), and a G M{AiY^^, 

9+1 

i=o 

The augmented Cech cochain complex is the sequence 

0 - ^C^{Ai,X) - ^C^{Ai,X) -. 

Proposition 5.3. For each q, 5^ is a group homomorphism, and we have o cfi = 0. 
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Definition 5.4. For each q >0, we define: 

• the ( 7 -cocycies := keriJ'^; 

• the ( 7 -coboundaries := im(5'?“^. 

By Proposition 15.31 these are subgroups of with B'^(A4,B) C Z^{M.^F). 

Definition 5.5. the g-th Cech cohomology group, is defined as the quotient 

Zi{M,T)/B‘i{M,T). 

Note that B^{M.,F) = 0, and hence H^{M.,F) = Z^{M.,F). A 0-cochain a; is a family {rc E 
F{C)}(j^j^. Since, for each 1-simplex a = (C,C"), 

5°(a;)((j) = rcicnc' - '^(^'Icnc' , 
a; is a cocycle (i.e. satisfies < 5 *^( 0 ) = 0) if and only if 

’’clcnc' = ’’c'icnc' 

for all maximal contexts C, C" E Ad with non-empty intersection. 

This almost states that r is a compatible family, except that it does not require compatibility over 
restrictions to the empty context. For our present purposes, we are only interested in connected 
covers (since one can always reduce the analysis of a scenario to its connected components), in 
which case the exception is irrelevant. This is because, given any two contexts C and C with 
empty intersection, there exists a sequence of contexts 

C = Co, Cl, ..., a = C 

such that Cj n Cj+i 7 ^ 0 for all i. Then, for any i, we have 

rCi\0 = ’’CilCiPiCi+iU = TCi+ilCinCi+iU = rci+i\0 ■ 

Consequently, rc\cnC' = fc \0 = 'f'c '\0 = ^C'lcnCE and so the family is compatible. 

5.2. Relative cohomology and obstructions. In order to solve the problem of extending a local 
section to a global compatible family, we need to consider the relative cohomology of F with respect 
to an open subset U O X. We will assume that the presheaf is flasque beneath the cover (as is the 
case with S). 

We define two auxiliary presheaves related to F. Firstly, F\if is defined by 

F\u{V) :=F{UnV) . 

There is an evident presheaf map p: F —?■ F\ jj given as 

pv : FiV) F{U n P) :: r ^ r\u 0 V ■ 

Secondly, ig defined by Fij{V) := ker {pv)- Thus, we have an exact sequence of presheaves 
(4) 0- ^Fjj - ^F^F\u . 

The relative cohomology of F with respect to U is defined to be the cohomology of the 
presheaf Fjy. 

We now see how this can be used to identify cohomological obstructions to the extension of a 
local section. First, recall that in light of Proposition 15.31 the image of B^[Fi^F), is contained 
in Z^{M.,F). Therefore, the map can be corestricted to a map m: C^(A4,F} —> Z^(A4,F}, 
whose kernel is Z^(A4,F) = H^{M,F) and whose cokernel is Z^{M,F)/B^{M,F) = H^{M,F). 
In summary, we have: 

H^{M,F) C^{M,F) Z\M,F)''-^^H^iM,F) . 
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We now lift the exact sequence of presheaves ([H) considered above to the level of cochains. The map 
C^{M,F) C^{M,Fjy) is surjective due to flaccidity beneath the cover. Putting this together 
with the previous observation, we obtain the diagram below: 


0 -^ C^{M,Ffj) -^ C°(M,F) 


6 ° 


so 


0 -^ Z^(M,F^) -^ Z^(M,F) 


C0(M,Flcf) - -0 

50 

zHm,fIu) 


whose two rows are short exact sequences. The snake lemma of homological algebra says that there 
exists a connecting homomorphism turning the kernels of the first row followed by the cokernels 
of the second into a long exact sequence, as shown in the following diagram. 



We are interested in the case where U is an element Cq of the cover Ai. Then, it is clear that 
Fjco) is isomorphic to F(Co), meaning that its elements are the local sections at Co- 

Definition 5.6. Let Co be an element of the cover At and rg G F{Co). Then, the cohomological 
obstruction of tq is the element 7 (ro) of H^{Ai,F(j^), where 7 is the connecting homomorphism. 

The following justifies regarding these as obstructions. 


Proposition 5.7. Let the cover Ai he connected, Cq G Ai, and vq G F{Co). Then, 7 (ro) = 0 if 
and only if there is a compatible family {rc G F{C)}q^j^ such that rcg = rg. 

Proof. The proof follows from the fact that 7 : H^{Ai,F\co) —> H^{.At,F(j^) is in a long exact 
sequence. Hence, the kernel of 7 is equal to the image of the map H^{Ai,F) —)• H^{Ai,F\co)- 
Recall that, since the cover is connected, the elements of H^{At,F) are the compatible families, 
and observe that this map takes such a family {rc G F(C)}ce.M to the element rc^ G F{Co). 
Hence, its image is exactly those local sections at Cq that belong to such a compatible family. □ 


5.3. Witnessing contextuality. We now apply these tools to analyse the possibilistic structure 
of empirical models. The cohomological obstructions of Dehnition 15.61 would appear to be ideally 
suited to the problem of identifying contextuality. The caveat is that, in order to apply those tools, 
it is necessary to work over a presheaf of abelian groups, whereas we are concerned with S, which 
is merely a presheaf of sets. 

We firstly consider how to build an abelian group from a set. 

Definition 5.8. Given a ring R, we define a functor Fr \ Set —)• i?-Mod to the category of R- 
modules (and thus, in particular, to the category of abelian groups). For each set X, Fji{X) is the 
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set of functions (p: X — )■ R of finite support. Given a function /: X — )• Y, we define: 

Fnf: FrX FrY :: P^Xy. ^ cP{x) . 

f{x)=y 

This assignment is easily seen to be functorial. We regard a function (p G Fr{X) as a formal 
i?-linear combination of elements of X: 

^ ■ 

xGX 

There is a natural embedding x ^ 1- x oi X into Fr(X), which we shall use implicitly throughout. 
In fact, Fr(X) is the free i2-module generated by X; and in particular, Fz{X) is the free 
abelian group generated by X. 

Given an empirical model S defined on the measurement scenario {X, we shall work with 

the (relative) Cech cohomology for the abelian presheaf FrS for some ring R. 

Definition 5.9. With each local section, s € S{C), in the support of an empirical model, we 
associate the cohomological obstruction 'jPusis)- 

• If there exists some local section sq G S{Co) such that 'yFusiso) / 0, we say that S 
is cohomologically logically contextual, or CLCij(5). We also use the more specific 
notation CLCi{(5,so)- 

• If 'YFjiSis) 7 ^ 0 for all local sections, we say that e is cohomologically strongly contex¬ 
tual, or CSCr. 

The following proposition justifies considering cohomological obstructions as witnessing contex- 
tuality. 

Proposition 5.10 ([5l Proposition 4.3]). 

• CLCr implies LC. 

• CSCr implies SC. 

Proof. Suppose an empirical model e is not logically contextual. Then for every maximal context 
Co G A4 and every sq G 5(Co), there is a compatible family {sc G S{C)}c£M with Scq = so. As 
5(C) embeds into FrS{C), {sc} is also a compatible family in FrS. Hence, by Proposition 15.71 
we conclude that 7 ( 5 ) = 0. The same argument can be applied to a single section witnessing the 
failure of strong contextuality. □ 

Thus we have a sufficient condition for contextuality in the existence of a cohomological obstruc¬ 
tion. Unfortunately, this condition is not, in general, necessary. It is possible that “false positives” 
arise in the form of families {rc G FrS{C)}c&m which are not bona fide global sections in 5(X) 
in which genuine global sections do not exist. 

Several examples are discussed in detail in [5]. An example for which a false positive arises 
is the Hardy model |23ll24j . However, cohomological obstructions over Z (in fact, Z 2 is enough) 
provide witnesses of strong contextuality for a number of well-studied models, including: the GHZ 
model [^1221138] . the Peres-Mermin “magic” square [3^1401144 j . and the 18-vector Kochen-Specker 
model m, the PR box [36], and the Specker triangle [311I39|. In fact, the Kochen-Specker model 
and the Specker triangle belong to a large class of models, known as -iGCD. In [5], it is shown all 
the models in this class admit cohomological witnesses for their strong contextuality. This result 
will be greatly generalised in ^ 

The coefficients for cohomology can be taken from any commutative ring R. We now consider 
how the cohomological obstructions obtained with different rings relate to each other. 

Proposition 5.11. Let h: R' —)• R be a ring homomorphism. Then, for any C G A4 and 
s G S{C), 7f^,5('S) = 0 implies ^F^sis) = 0, and so CSCr CSCri and CLCr CLC/j/. 
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Proof. If h: R —)• R' is a ring homomorphism, then for any set X there is a map 

Fh : FrX — Fr^X :: r i—^ hor . 

which is a group homomorphism. Moreover, this assignment is natural in X. Hence, this determines 
a presheaf map FrS —)• Fr/S, and compatible families on the former are mapped to compatible 
families on the latter. Since the map Fh above leaves the elements of the generating set fixed, the 
condition that the family agrees with sq at context Cq is preserved. □ 

We conclude this section with a remark. If {rc G FRS{C)}ceM is a compatible family, then 
the sum of the coefficients of the formal linear combinations rc is the same for all C. This holds 
because 5(0) = 5(0) = {*}; so that for any C G Ai, we have 

rc\0i*) = rcis) ; 

sesic) 

i.e. compatibility forces all these restrictions to the empty context to be the same. Therefore, when 
the obstruction of a section sq G 5(C'o) (more precisely, of the linear combination 1 • sq G FrS{Co)) 
vanishes, the corresponding family of linear combinations {rc G FrS{C)}c£M must in fact contain 
only affine combinations—those whose coefficients sum to one. 

6 . Cohomology and AvN arguments 

The aim of this section is to show that if an empirical model is AvN/^, then the cohomological 
obstructions witness its strong contextuality. Moreover, it is enough to consider cohomology with 
coefficients in the ring R itself. 

The result is stated as follows. 

Theorem 6.1. Let S he an empirical model on {X, Ai, R). Then: 

AvNr{S) ^ SC(Aff5) ^ CSCr{S) => CSCz(5) ^ SC(5) . 

The first of these implications was already established in Proposition 14.71 the third in Proposi¬ 
tion [STlland the fourth in Proposition [STOI 

In order to prove the second, we use the properties of the functor Fr : Set —>• i?-Mod that 
constructs the i?-module of formal i?-hnear combinations of elements of a set X. As already 
mentioned, FrX is the free i?-module generated by X. This means that it is the left adjoint of the 
forgetful functor U: i?-Mod —^ Set. 

Fr 

Set T i2-Mod 

u 

The unit p of this adjunction is the obvious embedding, which we have been using, taking an 
element x G X to the formal linear combination 1 • x. The counit is the natural transformation 
e: Fr o U ^ IdR-Mod given, for each i?-module M, by the evaluation map 

Em : FrU{M) — > M :: r i—^ ^ r(x)x . 

xgM 

We are interested in taking formal linear combinations of subsets of elements. Let us hx a module 
M and a subset S C U{M). Then the map cm, by virtue of being an i?-module homomorphism, 
maps the formal linear combinations of elements of S, Fr{S), which coincide with the linear span 
in FrU{M) of r][S] = {1 • s | s G S}, to the linear span of S in M, span^ S. Moreover, it maps the 
formal affine combinations Fff(5) = aff^^ U{M) vi^] to ttie affine closure affjvf S. 

Recall that we are dealing with measurement scenarios whose outcomes are identified with a 
ring R, hence where £{U) are themselves i?-modules, i.e. 5: V{X)°'^ —)■ R-Mod. As such, the 
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counit can be horizontally composed to yield a natural transformation, or map of presheaves, 
idf * e: Fr o U o £ — > £^ given at each context U F X hy FrU£{U) — > £{U)- Now, 

given an empirical model 5, we can apply the observation regarding subsets of the module at each 
context. But, since affg(f/) 5(C/) = (Aff5)(C/) by definition for U beneath the cover, and since 
containment still holds above it, we conclude that the presheaf map restricts as follows: 

Ff fC/5 >-- FrUS > -- FrU£ 


AfF S> -^ Span 5 >-9- £ 

We can now prove the theorem. 

Proof of Theorem \6.1[ We show the contrapositive. Suppose that S is not CSCr, i.e. that ypRsiso) = 
0 for some sq £ ‘^(C'o). Then, by Proposition 15.71 this is equivalent to the existence of a compatible 
family {rc € FrS{C)}(j^j^ with rc^ = sq. As observed at the end of ^5.31 all these rc must be 
formal affine combinations of elements in S(C). But then the presheaf map F^^US —)• Aff5 above 
pushes this compatible family to a compatible family of AfF 5, implying that the model AfF 5 is not 
strongly contextual. □ 

Essentially the same strategy can be used to prove an analogous result for logical contextuality. 
The notion of inconsistent theory has to be adapted: instead of asking whether there is a global 
assignment satisfying all the equations in the theory, we can ask, given a partial assignment sq € 
£{Co) whether there is such a global assignment with the additional requirement that it restricts to 
sq. This can be seen as a generalisation of the notion of robust constraint satisfaction studied 
in [3] from the complexity perspective. We write AvN/^(e, sq) if the theory of S has no solution 
extending sq. Then we have: 

AvN^Ce, So) => LC(AfF5,so) => CLCr{S,so) CLCz(5,so) =/ LC(5,so) . 

Discussion. We have shown that for a large class of models, their logical or strong contextuality is 
witnessed by cohomology. This subsumes and greatly generalises the results in [S]. Moreover, these 
models include a large class of concrete constructions arising from stabiliser quantum mechanics, 
going well beyond existing results of this kind in the quantum information literature. It remains 
an objective for future work to achieve a precise characterisation of what cohomology detects, and 
more generally full equivalences between the various ways of expressing contextuality. Note that, as 
already mentioned, the first implication in Theorem 16.11 can be reversed under the assumption that 
i? is a field. If we use a more abstract notion of equational consistency, in terms of quotient modules 
rather than equations expressed in a “coordinatized” form, then it can be reversed even for general 
rings. The point of taking the ground ring to be a field is exactly that it allows coordinatization. 

We also remark that the cohomological methods we have developed can be applied to an elabo¬ 
rated version of the treatment of logical paradoxes we gave in ^ following the lines of |16ll50j . We 
aim to give a detailed treatment of this “cohomology of paradox” in future work. We also note the 
intriguing resemblances, on the conceptual level at least, to the work of Roger Penrose in |43| . 
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